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(54) Speech coding apparatus, speech decoding apparatus, speech coding and decoding method 
and a phase amplitude characteristic extracting apparatus for carrying out the method 



(57) A speech coding and decoding apparatus for 
improving the quality of synthesized speech. A coding 
portion (1) includes a filter (30) for adding a short-term 
phase amplitude characteristic to an excitation signal, 
and a coding circuit (29) for quantizing and coding a 
phase amplitude characteristic. A decoding portion (2) 



1: CODING PORTION 
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includes a decoding circuit (31) for decoding the coded 
phase amplitude characteristic, and a filter (32) for add- 
ing the same phase amplitude characteristic as that in 
the coding portion. Thus, it is possible to synthesize high- 
quality speech with good reproducibility of the phase 
amplitude characteristic of an excitation signal. 
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Description 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a code-excited lin- 
ear prediction speech coding apparatus for compressing 
and coding a speech signal into a digital signal, a code 
driving linear prediction speech decoding apparatus for 
decoding the compressed signal, a speech coding and 
decoding method and a phase amplitude characteristic 
extracting apparatus which is available for this method. 

Description of the Prior Art 

Fig. 7 shows the overall structure of an example of 
a conventional code-excited linear prediction speech 
coding and decoding apparatus which is shown in 
"Improved Speech Quality and Efficient Vector Quanti- 
zation in SELP" by W. B. Kleijn, D. J. Krasinski, R. H. 
Ketchum (ICASSP 88, pp. 155 to 158. 1988). 

This apparatus includes a coding portion I, a decod- 
ing portion 2, a multiplexing means 3 and a separating 
means 4. Input speech 5 is input to these elements and 
output therefrom as output speech 6. This apparatus fur- 
ther includes a linear prediction parameter analysis 
means 7, a linear prediction parameter coding means 8, 
and synthesis filters 9, 18. Adaptive codebooks 10, 14, 
random codebooks 11, 15, and an optimum code 
searching means 12 constitute an excitation signal gen- 
erating means. The gains of codevectors are coded by 
an excitation gain coding means 13. The decoding por- 
tion 2 includes an excitation gain decoding means 16 and 
a linear prediction parameter decoding means 1 7. 

The operation of the conventional code-excited lin- 
ear prediction speech coding and decoding apparatus 
will now be explained. 

In the coding portion 1. the linear prediction param- 
eter analysis means 7 first extracts a linear prediction 
parameter by analyzing the input speech 5. The linear 
prediction parameter coding means 8 then quantizes the 
linear prediction parameter, and outputs the code corre- 
sponding to the parameter to the multiplexing means 3 
and the quantized linear prediction parameter to the syn- 
thesis filter 9. 

The adaptive codebook 10 stores excitation signals 
which have been obtained and outputs an adaptive vec- 
tor which corresponds to an adaptive code L input from 
the optimum code searching means 12. The random 
codebook 1 1 stores N random vectors which are pro- 
duced from random noise, for example, and outputs a 
random vector which corresponds to a random code I 
input from the optimum code searching means 12. The 
synthesis filter 9 generates synthesized speech by using 
the quantized linear prediction parameter and an excita- 
tion signal which is obtained by adding the adaptive vec- 
tor and the random vector which are multiplied by 
excitation gains p and y, respectively. 



The optimum code searching means 12 evaluates 
the perceptual weighted distortion constituting a residual 
signal between the synthesized speech and the input 
speech 5, obtains the adaptive code L. the random code 
5 I and the excitation gains p and y which minimize the dis- 
tortion, and outputs the adaptive code L and the random 
code I to the multiplexing means 3 and the excitation 
gains p and y to the excitation gain coding means 13. 
The excitation gain coding means 13 quantizes the exci- 
te tation gains p and y and outputs those codes to the mul- 
tiplexing means 3. 

The adaptive codebook 10 updates the contents of 
the codebook 1 0 by using the excitation signal generated 
by using the adaptive vector corresponding to the adap- 

is tive code L, the random vector corresponding to the ran- 
dom code I and the quantized excitation gains p and y 
which minimize the distortion. 

As a result of the above-described operation, the 
multiplexing means 3 supplies the code which corre- 

20 sponds to the quantized linear prediction parameter, and 
the codes which correspond to the adaptive code U the 
random code I and the excitation gains p and y to a trans- 
mission path. 

The operation of the decoding portion 2 will now be 

25 explained. 

The separating means 4 which receives the outputs 
from the multiplexing means 3 separates the outputs and 
transmits the supplied adaptive code L to the adaptive 
codebook 1 4, the random code I to the random codebook 

30 1 5, the codes of the excitation gains p and y to the exci- 
tation gain decoding means 1 6, and the code of the linear 
prediction parameter to the linear prediction parameter 
decoding means 1 7. 

The adaptive codebook 1 4 outputs the adaptive vec- 

35 tor which corresponds to the adaptive code L. and the 
random codebook 15 outputs the random vector which 
corresponds to the random code I. The excitation gain 
decoding means 16 decodes the excitation gains p and 
y and as to multiply the adaptive vector by the gain p and 

40 the random vector by the gain y. 

The linear prediction parameter decoding means 17 
decodes the linear prediction parameter which corre- 
sponds to the code of the linear prediction parameter and 
outputs the decoded linear prediction parameter to the 

45 synthesis filter 1 8. The synthesis filter 1 8 synthesizes an 
excitation signal which is obtained by adding the adap- 
tive vector and the random vector by using the linear pre- 
diction parameter, and outputs the output speech 6. 
The adaptive codebook 14 updates the contents of 

so the codebook by using the excitation signal in the same 
way as the adaptive codebook 1 0 of the coding portion 1 . 

Another coding and decoding apparatus is shown in 
Fig. 8. 

Fig. 8 shows an apparatus having coding and 
55 decoding means for coding and decoding the phase 
characteristic of an excitation signal which is shown in 
"Speech Coding Using All-pass Filter Response" by 
Ikeda, Nakamura and Asada (Technical Reports of the 
Institute of Electronics, Information and Communication 
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Engineers SP 91 -72, pp. 45 to 52, 1991). The structure 
of this apparatus is different from that of the apparatus 
shown in Fig. 7 in that the former further includes pulse 
train generating means 19. 25, phase characteristic 
codebooks 20. 26, phase characteristic adding filters 21 , 5 
27, an optimum excitation • phase characteristic search* 
ing means 22. a pulse position coding means 23 and a 
pulse position decoding means 24. 

In the coding portion 1 , the pulse train generating 
means 19 outputs a pulse train which corresponds to the 10 
position of the head pulse and the pulse interval which 
are input from the optimum excitation ■ phase character- 
istic searching means 22. The phase characteristic add- 
ing filter 21 is, for example, an N-order all-pass filter 
whose transfer function H(z) is represented by the fol- is 
lowing formula (1): 

H(2)=^ (1) 20 

The phase characteristic codebook 20 stores a 
plurality of filter coefficients which are created on the 
assumption that the impulse response of the phase char- 
acteristic adding filter 21 , for example, is given as a ran- 
dom sequence of numbers, and outputs the filter 
coefficient which corresponds to the code input from the 
optimum excitation ■ phase characteristic searching 
means 22 to the phase characteristic adding filter 21. 
The phase characteristic adding filter 21 adds a phase 
characteristic by using the filter coefficient to the excita- 
tion signal which is obtained by multiplying the pulse train 
output from the pulse train generating means 19 by an 
excitation gain g mission, by using the filter coefficient, 
and outputs the phase characteristic added excitation 
signal to the synthesis filter 9. The synthesis filter 9 gen- 
erates synthesized speech by using the quantized linear 
prediction parameter which is input from the linear pre- 
diction parameter coding means 8 and the excitation sig- 
nal to which the phase characteristic is added. 

The optimum excitation ■ phase characteristic 
searching means 22 obtains the position of the head 
pulse and the pulse interval of the pulse train, the exci- 
tation gain g and the code of the phase characteristic 
which minimize the perceptual weighted distortion of a 
residual signal between the synthesis speech and the 
input speech 5. and outputs the position of the head 
pulse and the pulse interval of the pulse train to the pulse 
position coding means 23. the excitation gain g to the 
excitation gain coding means 13, and the code of the 
phase characteristic to the multiplexing means 3. 

The pulse position coding means 23 quantizes the 
position of the head pulse and the pulse interval of the 
pulse train and outputs the codes to the multiplexing 
means 3. 

The multiplexing means 3 which has received these 
codes transfers the code which corresponds to the linear 



prediction parameter, the code of the phase character- 
istic, the codes which correspond to the quantized posi- 
tion of the head pulse and the pulse interval of the pulse 
train, and the code corresponding to the quantized exci- 
tation gain g to the separating means 4. 

The operation of the decoding portion 2 will now be 
explained. 

The separating means 4 which has received the out- 
puts of the multiplexing means 3 outputs the codes which 
correspond to the quantized position of the head pulse 
and the pulse interval of the pulse train to the pulse posi- 
tion decoding means 24, the code of the excitation gain 
g to the phase characteristic codebook 26, and the code 
of the linear prediction parameter to the linear prediction 
parameter decoding means 17. 

The pulse position decoding means 24 decodes the 
position of the head pulse and the pulse interval which 
correspond to the codes of the position of the head pulse 
and the pulse interval of the pulse train and outputs the 
decoded position and pulse interval to the pulse train 
generating means 25. The pulse train generating means 
25 outputs the pulse train which corresponds to the posi- 
tion of the head pulse and the pulse interval to the phase 
characteristic adding filter 27. 

The excitation gain decoding means 1 6 decodes the 
excitation gain g which corresponds to the code of the 
excitation gain. The phase characteristic codebook 26 
outputs the filter coefficient which corresponds to the 
code of the phase characteristic to the phase character- 
istic adding filter 27. 

The phase characteristic adding filter 27 adds the 
phase characteristic to the excitation signal which is 
obtained by multiplying the pulse train by the excitation 
gain g, by using the filter coefficient, and outputs the exci- 
tation signal obtained to the synthesis filter 18. The syn- 
thesis filter 18 outputs the output speech 6 by using the 
linear prediction parameter which is input from the linear 
prediction decoding means 17 and the excitation signal 
with the phase characteristic added thereto. 

A conventional apparatus for obtaining the short- 
term phase amplitude characteristic of the linear predic- 
tion residual signal of speech is shown in Fig. 9. This is 
an apparatus described in "Speech Encoding Based on 
Phase Equalization** by Honda and Moriya (Transactions 
of the Committee on Speech Research The Acoustical 
Society of Japan $84-05, pp. 33 to 40, 1984). 

In Fig. 9. speech is input as input speech 101 , and 
a phase amplitude characteristic 102 is obtained. This 
apparatus includes a linear prediction parameter analy- 
sis means 103, a linear predictive inverse filter 104, a 
pitch extracting means 105, a pitch position extracting 
means 106, and a phase amplitude characteristic adding 
filter coefficient calculator 1 07. 

The process for obtaining the short-term phase 
amplitude characteristic of the linear prediction residual 
signal of speech will be explained. 

When the input speech 101 is input, the linear pre- 
diction parameter analysis means 1 03 analyzes the input 
speech 101 so as to extract the linear prediction param- 
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eter and outputs the extracted linear prediction parame- 
ter to the linear predictive inverse filter 104. The linear 
predictive inverse filter 1 04 generates a linear prediction 
residual signal from the input speech 101 by using the 
linear prediction parameter, and outputs the linear pre- 
diction residual signal to the pitch position extracting 
means 106 and the phase amplitude characteristic add- 
ing filter coefficient calculator 107. 

The pitch extracting means 105 extracts the pitch 
period of the input speech 101 by a known method and 
outputs the extracted pitch period to the pitch position 
extracting means 106. The pitch position extracting 
means 106 extracts the pitch position at every pitch 
period as the position at which the linear prediction resid- 
ual signal has the maximumjn amplitude in one pitch 
period, and outputs the pitch position to the phase ampli- 
tude characteristic adding filter coefficient calculator 
107. 

The phase amplitude characteristic adding filter 
coefficient calculator 1 07 obtains the function of a phase 
amplitude characteristic adding filter (Fig. 10) having an 
impulse response which outputs the linear prediction 
residual signal when a pulse train, in which pulses exist 
only at pitch positions, is input, and outputs the function 
as the phase amplitude characteristic 102. The phase 
amplitude characteristic adding filter is, for example, an 
N-order filter whose transfer function H(z) is represented 
by the following formula (2). 

N 

(2) 

Alternatively, the phase amplitude characteristic adding 
filter may be, for example, an N-order all-pass filter 
whose transfer function H(z) is represented by the for- 
mula (1). 

The above-described prior art has the following 
problems. 

Speech is composed of voiced speech and unvoiced 
speech. The reproducibility of voiced speech exerts a 
great influence on the quality of synthesized speech. It 
is possible to model the excitation of a voiced sound in 
the form of a signal having a pitch periodicity and a short- 
term phase characteristic in the pitch periodicity. 

In the conventional code-excited linear prediction 
speech coding apparatus, the excitation signal is repre- 
sented by the sum of an adaptive vector and a random 
vector. This method does not directly represent the 
phase characteristic of the excitation signal. Therefore, 
there is a case in which the phase characteristic of the 
excitation signal is not reproduced, which leads to adete- 
rioration of the quality of synthesized speech. 

This problem is serious, for example, at a transitional 
portion from unvoiced speech to voiced speech or at a 
voiced speech where the pitch period changes greatly. 
At such a portion, an adaptive vector does not ade- 
quately work so that it is necessary to reproduce the pitch 
period and the phase characteristic using only the ran- 
dom vector. 



In the conventional coding and decoding apparatus 
for coding the phase characteristic of an excitation sig- 
nal, although the phase characteristic of an excitation 
signal is coded, since an excitation signal is assumed to 

5 have a simple pulse train, when an appropriate phase 
characteristic is not found in the phase characteristic 
codebook, it is impossible to complete the phase char- 
acteristic using an excitation signal, which leads to a 
deterioration of the quality of synthesized speech. 

w In the case of adopting the conventional method of 
obtaining the short-term phase amplitude characteristic 
of the linear prediction residual signal of speech, 
although it is necessary to obtain the pitch period and 
the pitch position, since it is not always possible to obtain 

is the exact pitch period and pitch position, the difference 
between the phase amplitude characteristic obtained 
from the inexact pitch period and pitch position and that 
obtained from the exact ones will increase according to 
the degree of the error. 

20 

SUMMARY OF THE INVENTION 

Accordingly, it is an object of the present invention 
to eliminate the above-described problems in the prior 
25 art and to provide a code-excited linear prediction 
speech coding and decoding apparatus and a speech 
coding and decoding method which can avoid a deteri- 
oration in the quality of synthesized speech and generate 
synthesized speech having a good quality. 
30 To achieve this end, in a first aspect of the present 
invention there is provided a speech coding apparatus 
comprising: a linear prediction parameter analysis 
means; a linear prediction parameter coding means; an 
excitation signal generating means; a synthesis filter for 
35 synthesizing the output signal of the linear prediction 
parameter coding means and the excitation signal output 
from the excitation signal generating means; a phase 
amplitude characteristic coding means for quantizing 
and coding the phase amplitude characteristic which is 
40 obtained by analyzing the linear prediction residual sig- 
nal of an input speech signal; and a phase amplitude 
characteristic adding filter for adding a short-term phase 
amplitude characteristic to the excitation signal. 

According to this structure, the short-term phase 
45 amplitude characteristic of an excitation signal is quan- 
tized and coded, so that the phase amplitude character- 
istic is positively added to the excitation signal. As a 
result, it is possible to synthesize speech of a high quality 
with a good reproducibility of the phase characteristic of 
so the excitation signal. 

In a second aspect of the present invention, there is 
provided a speech decoding apparatus comprising: a lin- 
ear prediction parameter decoding means; an excitation 
signal generating means; a synthesis filter for synthesiz- 
55 ing the output signal of the linear prediction parameter 
decoding means and the excitation signal output from 
the excitation signal generating means; a phase ampli- 
tude characteristic decoding means for decoding a 
coded short-term phase amplitude characteristic; and a 
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phase amplitude characteristic adding filter for adding 
the decoded phase amplitude characteristic to the exci- 
tation signal. 

According to this structure, the coded short-term 
phase amplitude characteristic is decoded, and the 
phase amplitude characteristic is positively added to the 
excitation signal. As a result, it is possible to synthesize 
speech of a high quality with a good reproducibility of the 
phase characteristic of the excitation signal. 

In a third aspect of the present invention, there is 
provided a speech coding and decoding method com- 
prising a coding process and a decoding process: 

the coding process including the steps of: coding 
a linear prediction parameter by the linear prediction 
analysis of an input speech signal; selecting a codevec- 
tor for generating optimum synthesized speech from an 
adaptive codebook and a random codebook; and coding 
and transmitting the excitation signal; and 

the decoding process including the steps of: gen- 
erating an excitation signal and a decoded linear predic- 
tion parameter signal on the basis of the received signal; 
and synthesizing the excitation signal and the decoded 
linear prediction parameter signal by a synthesis filter so 
as to generate an output speech signal. The coding proc- 
ess further includes the steps of: quantizing and coding 
the phase amplitude characteristic which is obtained by 
analyzing the linear prediction residual signal of an input 
speech signal; and adding a short-term phase amplitude 
characteristic to the excitation signal, and the decoding 
process further includes the steps of: decoding the 
coded phase amplitude characteristic; and adding the 
decoded phase amplitude characteristic to the excitation 
signal so as to generate the output speech signal. 

According to this structure, the short-term phase 
amplitude characteristic of an excitation signal is quan- 
tized in the coding process, and the coded phase ampli- 
tude characteristic is decoded in the decoding process, 
so that the phase amplitude characteristic is positively 
added to the excitation signal. As a result, it is possible 
to transmit speech of a high quality with a good repro- 
ducibility of the phase characteristic of the excitation sig- 
nal. 

In a fourth aspect of the present invention, there is 
provided a phase amplitude characteristic extracting 
apparatus for extracting the short-term phase amplitude 
characteristic of a signal, comprising: a phase amplitude 
characteristic codebook which stores a plurality of short- 
term phase amplitude characteristics of signals; a phase 
amplitude characteristic removing filter for removing a 
phase amplitude characteristic; a residual signal gener- 
ating means for generating a residual signal by removing 
the phase amplitude characteristic stored in the phase 
amplitude characteristic codebook from the input signal 
the phase amplitude characteristic removing filter; a 
pulse approximate means or a pulse signal representa- 
tion means for generating a pulse approximated signal 
or a pulse signal representation signal by reducing the 
residual signal to a small number of pulses; a trial signal 
generating means for generating a trial signal by adding 



each removed phase amplitude characteristic to the 
pulse approximated signal; and a selecting and output- 
ting means for selecting the phase amplitude character- 
istic which minimizes the distortion between the trial 
5 signal and the input signal, from the phase amplitude 
characteristic codebook and outputting the selected 
phase amplitude characteristic. 

According to this structure, a residual signal is 
obtained by removing each of the phase amplitude char- 
to acteristics stored in the phase amplitude characteristic 
codebook from an input signal by inverse filters, and 
each residual signal is reduced to a small number of 
pulses. Each of the removed phase amplitude charac- 
teristics is added to the approximate signal, and the 
is phase amplitude characteristic which minimizes the dis- 
tortion between this signal and the input signal is 
selected from the codebook. In this way. the short-term 
phase amplitude characteristic of the signal is obtained. 
As a result, for example, when the short-term phase 
20 amplitude characteristic of the linear prediction residual 
signal of a speech is obtained, it is not necessary to 
extract the pitch period and the pitch position, thereby 
preventing an error in the extraction of the phase ampli- 
tude characteristic. 
25 The above and other objects, features and advan- 
tages of the present invention will become clear from the 
following description of the preferred embodiments 
thereof, taken in conjunction with the accompanying 
drawings. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of the overall structure of a 
first embodiment of the present invention; 
35 Fig. 2 is a block diagram of the overall structure of a 
second embodiment of the present invention; 
Fig. 3 shows an example of excitation vectors con- 
sisting of a pulse train having a pitch period in 
accordance with the present invention; 
40 Fig. 4 shows an example of the excitation vectors 
stored in a pulse random codebook in accordance 
with the present invention; 
Fig. 5 is a block diagram of the structure of an appa- 
ratus for obtaining a short-term phase amplitude 
45 characteristic in a third embodiment of the present 
invention: 

Fig. 6 shows the wave forms explaining an example 
of the generation of a pulse approximated signal in 
the present invention; 
so Fig. 7 is a block diagram of the overall structure of 
an example of a conventional code-excited linear 
prediction speech coding and decoding apparatus; 
Fig. 8 is a block diagram of the overall structure of 
an example of a conventional coding and decoding 
55 apparatus for coding the phase characteristic of an 
excitation signal; 

Fig. 9 is a block diagram of a conventional apparatus 
for obtaining a short-term phase amplitude charac- 
teristic of an excitation signal; and 
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Fig. 10 is an explanatory view of a change in the 
wave form due to a phase amplitude characteristic 
adding filter. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

First Embodiment 

A speech coding and decoding apparatus according 
to the present invention will be explained with reference 
to the accompanying drawings. 

Fig. 1 is a block diagram of a first embodiment of a 
speech coding and decoding apparatus according to the 
present invention. The same elements as those shown 
in Fig. 7 are provided with the same reference numerals 
and explanation thereof will be omitted. 

This embodiment is characterized by the following 
newly added elements: phase amplitude characteristic 
analysis means 28 for analyzing a phase amplitude char- 
acteristic, phase amplitude characteristic coding means 
29 for coding a phase amplitude characteristic, phase 
amplitude characteristic adding filters 30, 32 for adding 
a phase amplitude characteristic, and phase amplitude 
characteristic decoding means 31 for decoding phase 
amplitude characteristic. 

In the coding portion 1 , the phase amplitude char- 
acteristic analysis means 28 generates a linear predic- 
tion residual signal by using the input speech 5 and the 
linear prediction parameter which is input from the linear 
prediction parameter coding means 8, obtains the short- 
term phase amplitude characteristic of the linear predic- 
tion residual signal as a filter coefficient by using, for 
example, a conventional method of obtaining the short- 
term phase amplitude characteristic of a linear prediction 
residual signal of speech, and outputs the filter coeffi- 
cient to the phase amplitude characteristic coding means 
29. The phase amplitude characteristic coding means 29 
quantizes the filter coefficient and outputs the corre- 
sponding code to the multiplexing means 3, and the 
quantized filter coefficient to the phase amplitude char- 
acteristic adding filter 30. 

The phase amplitude characteristic adding filter 30 
adds the phase amplitude characteristic by using the 
quantized filter coefficient to the excitation signal which 
is obtained by multiplying the adaptive vector which is 
output from the adaptive codebook 10 by the excitation 
gain p and multiplying the random vector which is output 
from the random codebook 1 1 by the excitation gain y, 
and adding the products, and outputs the thus-obtained 
excitation signal to the synthesis filter 9. The synthesis 
filter 9 generates synthesized speech by using the quan- 
tized linear prediction parameter which is input from the 
linear prediction parameter coding means 8 and the exci- 
tation signal with the phase amplitude characteristic 
added thereto. 

The optimum code searching means 12 evaluates 
the perceptual weighted distortion of a residual signal 
between the synthesized speech and the input speech 



5, obtains the adaptive code L, the random code I and 
the excitation gains p and y which minimize the distortion, 
and outputs the adaptive code L and the random code I 
to the multiplexing means 3 and the excitation gains p 
s and y to the excitation gain coding means 13. The exci- 
tation gain coding means 13 quantizes the excitation 
gains p and y and outputs those codes to the multiplexing 
means 3. 

On the basis of these results, the multiplexing means 
10 3 supplies the code which corresponds to the quantized 
linear prediction parameter, the code which corresponds 
to the quantized filter coefficient of the phase amplitude 
characteristic adding filter 30. and the codes which cor- 
respond to the adaptive code L, the random code I and 
is the excitation gains p and y to a transmission path. 

The above-described operation is characteristic of 
the coding portion 1 of a speech coding and decoding 
apparatus of this embodiment. 

The operation of the decoding portion 2 will now be 
20 explained. 

The separating means 4 which receives the outputs 
from the multiplexing means 3 separates the outputs and 
transmits the supplied adaptive code L to the adaptive 
codebook 1 4. the random code I to the random codebook 
25 1 5. the codes of the excitation gains p and y to the exci- 
tation gain decoding means 1 6. the code of the filter coef- 
ficient of the phase amplitude characteristic adding filter 
30 to the phase amplitude characteristic decoding 
means 31, and the code of the linear prediction param- 
30 eter to the linear prediction parameter decoding means 
17. 

The phase amplitude characteristic decoding 
means 31 decodes the filter coefficient which corre- 
sponds to the code of the filter coefficient of the phase 

35 amplitude characteristic adding filters 30 and outputs the 
decoded filter coefficient to the phase amplitude charac- 
teristic adding filter 32. 

The phase amplitude characteristic adding filter 32 
adds the phase amplitude characteristic obtained using 

40 decoded quantized filter coefficient to the excitation sig- 
nal which is obtained by multiplying the adaptive vector 
which is output from the adaptive codebook 14 by the 
excitation gain p output from the excitation gain decoding 
means 16 and multiplying the random vector which is 

45 output from the random codebook 15 by the excitation 
gain y output from the excitation gain decoding means 
16, and adding the products, and outputs the thus- 
obtained excitation signal to the synthesis filter 18. The 
synthesis filter 18 generates synthesized speech by 

so using the linear prediction parameter which is input from 
the linear prediction parameter decoding means 17 and 
the excitation signal with the phase amplitude character- 
istic added thereto, and outputs the synthesized speech. 
The above-described operation is characteristic of 

55 the decoding portion 2 of a speech coding and decoding 
apparatus of this embodiment. 

According to this embodiment, it is possible to 
enhance the reproducibility of an excitation signal and to 
improve the quality of synthesized speech by coding the 
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short-term phase amplitude characteristic of a linear pre- 
diction residual signal and addling it to the excitation sig- 
nal. 

Second Embodiment 5 

Another embodiment of a speech coding and decod- 
ing apparatus according to the present invention will be 
explained with reference to the accompanying drawings. 

Fig. 2 is a block diagram of a second embodiment 10 
of a speech coding and decoding apparatus according 
to the present invention. The same elements as those 
shown in Fig. 1 are provided with the same reference 
numerals and explanation thereof will be omitted. 

In this embodiment, the following elements are 15 
newly added to the first embodiment: pitch extracting 
means 33 for extracting a pitch period, pitch coding 
means for coding an extracted pitch period, pulse ran- 
dom codebooks 35. 37, and pitch decoding means 36. 

The operation of this embodiment will now be 20 
explained with priority given to the newly added ele- 
ments. 

In the coding portion 1 , the pitch extracting means 
33 extracts the pitch period of the input speech 5 by a 
known method and outputs the extracted pitch period to 25 
the pitch coding means 34. The pitch coding means 34 
quantizes the pitch period and outputs the corresponding 
code to the multiplexing means 3 and the quantized pitch 
period to the pulse random codebook 35. 

The puise random codebook 35 generates a plural- 30 
ity of excitation vectors consisting of a pulse train of the 
quantized pitch period in which, for example, the posi- 
tions of the head pulses are different, and stores them 
as at least a part of the random vectors in the codebook 
35. Fig. 3 shows an example of the excitation vector con- 
sisting of a pulse train of the pitch period, and Fig. 4 
shows an example of the excitation vectors stored in the 
puise random codebook 35. And the puise random code- 
book 35 outputs the random vector which corresponds 
to the random code I input from the optimum code 
searching means 12. 

The phase amplitude characteristic adding filter 30 
adds the phase amplitude characteristic obtained using 
the quantized filter coefficient input from the phase 
ampiitude characteristic coding means 29 to the excita- 
tion signal which is obtained by multiplying the adaptive 
vector which is output from the adaptive codebook 10 by 
the excitation gain p and multiplying the random vector 
which is output from the pulse random codebook 35 by 
the excitation gain y, and adding the products, and out- 
puts the thus-obtained excitation signal to the synthesis 
filter 9. The synthesis filter 9 generates synthesized 
speech by using the quantized linear prediction param- 
eter which is input from the linear prediction parameter 
coding means 8 and the excitation signal with the phase 
amplitude characteristic added thereto. 

The optimum code searching means 12 evaluates 
the perceptual weighted distortion of a residual signal 
between the synthesized speech and the input speech 



5, obtains the adaptive code L, the random code I and 
the excitation gains p and y which minimize the distortion, 
and outputs the adaptive code L and the random code I 
to the multiplexing means 3 and the excitation gains p 
and y to the excitation gain coding means 13. The exci- 
tation gain coding means 13 quantizes the excitation 
gains p and y and outputs those codes to the multiplexing 
means 3. 

On the basis of these results, the multiplexing means 
3 supplies the code which corresponds to the quantized 
linear prediction parameter, the code which corresponds 
to the quantized filter coefficient of the phase amplitude 
characteristic adding filter 30 and the codes which cor- 
respond to the adaptive code L. the quantized pitch 
period, the random code I and the excitation gains p and 
y to a transmission path. 

The schematic structure of the coding portion 1 of 
the second embodiment of the speech coding and 
decoding apparatus has been described above. 

The operation of the decoding portion 2 will now be 
explained. 

The separating means 4 which receives the outputs 
from the multiplexing means 3 separates the outputs and 
transmits the supplied adaptive code L to the adaptive 
codebook 14, the code of the pitch period to the pitch 
decoding means 36, the random code I to the random 
codebook 37, the codes of the excitation gains p and y 
to the excitation gain decoding means 16, the code of 
the filter coefficient of the phase amplitude characteristic 
adding filter 30 to the phase amplitude characteristic 
decoding means 3 1 , and the code of the linear prediction 
parameter to the linear prediction parameter decoding 
means 17. 

The pitch decoding means 36 decodes the pitch 
period which corresponds to the code of the pitch period 
and outputs the decoded pitch period to the pulse ran- 
dom codebook 37. The pulse random codebook 37 
stores the excitation vector consisting of a pulse train of 
the decoded pitch period in the codebook 37 in the same 
way as the random codebook 35. The pulse random 
codebook 37 outputs the random vector which corre- 
sponds to the random code I. 

The phase amplitude characteristic adding filter 32 
adds the phase amplitude characteristic by using the fil- 
ter coefficient input from the phase amplitude character- 
istic decoding means 31 to the excitation signal which is 
obtained by multiplying the adaptive vector which is out- 
put from the adaptive codebook 1 4 by the excitation gain 
p and multiplying the random vector which is output from 
the pulse random codebook 37 by the excitation gain y, 
and adding the products, and outputs the thus-obtained 
excitation signal to the synthesis filter 18. The synthesis 
filter 18 outputs an output speech 6 by using the linear 
prediction parameter which is input from the linear pre- 
diction parameter decoding means 17 and the excitation 
signal with the phase amplitude characteristic added 
thereto. 

As has been described above, according to the sec- 
ond embodiment, a pulse train of a pitch period is used 
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for a random vector, and a phase amplitude characteris- 
tic is added to the random vector. In this manner, it is 
possible to generate an appropriate excitation signal 
from only a random vector. Consequently, even if an 
adaptive vector does not work, it is possible to produce 
an excitation signal with good reproducibility and to 
improve the quality of synthesized speech. 

In this embodiment the pulse train may be obtained 
from an adaptive code. In this case, the pitch extracting 
means 33, the pito- coding means 34 and the pitch 
decoding means 36 in Fig. 2 are eliminated, and the 
pulse interval of the pulse train which is used as a ran- 
dom vector is obtained from the adaptive code. At this 
time, since it is not necessary to transmit the information 
of the pitch period with respect to the pulse interval, it is 
possible to reduce the amount of information transmit- 
ted. In addition, since the reproducibility of an excitation 
signal is good even if the adaptive vector does not work, 
it is possible to improve the quality of synthesized 
speech. 

Third Embodiment 

An embodiment of a phase amplitude characteristic 
extracting apparatus for extracting the short-term phase 
amplitude characteristic of a signal according to the 
present invention will be explained with reference to the 
accompanying drawings. 

Fig. 5 is a block diagram of the structure of an appa- 
ratus for obtaining a phase amplitude characteristic. This 
apparatus is used to obtain the short-term phase ampli- 
tude characteristic of a linear prediction residual signal. 

The following elements are newly added to the con- 
ventional apparatus shown in Fig. 9: a phase amplitude 
characteristic codebook 1 08. a phase amplitude charac- 
teristic removing filter 1 09 for removing the characteristic 
of a phase amplitude, pulse approximate means 1 10 for 
approximating or representing a residual signal by some 
pulses, a phase amplitude characteristic adding filter 1 1 1 
for adding the characteristic of a phase amplitude, a syn- 
thesis filter 112 for synthesizing a speech form a linear 
prediction parameter and an excitation signal, and opti- 
mum phase amplitude characteristic searching means 
1 13 for searching an optimum phase amplitude charac- 
teristic. 

The operation of the apparatus will be explained with 
priority given to the characteristic structure thereof. 

The linear prediction parameter analysis means 1 03 
analyzes input speech 1 01 so as to extract the linear pre- 
diction parameter and outputs the extracted linear pre- 
diction parameter to the linear predictive inverse filter 
104 and the synthesis filter 112. The linear predictive 
inverse filter 104 generates a linear prediction residual 
signal from the input speech 101 by using the linear pre- 
diction parameter, and outputs the linear prediction 
residual signal to the phase amplitude characteristic 
removing filter 109. 

A plurality of phase amplitude characteristics are 
stored in the phase amplitude characteristic codebook 



108 as, for example, filter coefficients, and the phase 
amplitude characteristic codebook 108 outputs the filter 
coefficient of the phase amplitude characteristic which 
corresponds 10 the code input from the optimum phase 
5 amplitude characteristic searching means 113 to the 
phase amplitude characteristic removing filter 109 and 
the phase amplitude characteristic adding filter 111. The 
phase amplitude characteristic removing fitter 109 gen- 
erates a residual signal by removing the phase amplitude 
w characteristic from the linear prediction parameter signal 
by using the filter coefficient, and outputs the residual 
signal to the pulse approximate means 110. The pulse 
approximate means 110 generates a pulse signal repre- 
sentation residual signal by reducing the residual signal 
is to zero except for N samples having the largest ampli- 
tude, for example, and outputs the pulse signal represen- 
tation residual signal to the phase amplitude 
characteristic adding filter 111. 

Fig. 6 shows an example of representation. Fig. 6 
20 shows the process of generating a residual signal from 
a linear prediction residual signal by removing the phase 
amplitude characteristic, and then reducing the residual 
signal to a pulse so as to generate a pulse signal repre- 
sentation residual signal. 
25 The phase amplitude characteristic adding filter 1 1 1 
then adds the phase amplitude characteristic to the pulse 
signal representation residual signal by using the filter 
coefficient so as to produce an excitation signal and out- 
puts the excitation signal to the synthesis filter 112. The 
30 synthesis filter 112 generates synthesized speech by 
using the linear prediction parameter and the excitation 
signal. 

The optimum phase amplitude characteristic 
searching means 1 1 3 evaluates the perceptual weighted 
35 distortion of the residual signal between the synthesized 
speech and the input speech 101, selects the filter coef- 
ficient corresponding to the phase amplitude character- 
istic which minimizes the distortion from the phase 
amplitude characteristic codebook 108, and outputs the 
40 selected filter coefficient as the phase amplitude charac- 
teristic 102. 

According to this embodiment, a codebook which 
stores a plurality of short-term phase amplitude charac- 
teristic of a signal is provided, a trial signal is generated 
45 by using each phase amplitude characteristic in the 
codebook and the phase amplitude characteristic which 
minimizes the distortion between an input signal and the 
trial signal is selected from the codebook. In this manner, 
it is possible to extract the phase amplitude characteristic 
so without an error and without the need for pitch extraction 
or pitch position extraction when the short-term phase 
amplitude characteristic of a linear prediction residual 
signal of speech is obtained. 

While there has been described what are at present 
55 considered to be preferred embodiments of the inven- 
tion, it will be understood that various modifications may 
be made thereto, and it is intended that the appended 
claims cover all such modifications as fall within the true 
spirit and scope of the invention. 
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Claims 

1 . A speech coding apparatus comprising: 

linear prediction parameter analysis means; 

linear prediction parameter coding means; s 

excitation signal generating means; 

a synthesis filter for synthesizing an output 
signal of said linear prediction parameter coding 
means and an excitation signal output from said 
excitation signal generating means; ro 

phase amplitude characteristic coding 
means for quantizing and coding a phase amplitude 
characteristic which is obtained by analyzing a linear 
prediction residual signal of input speech signal; and 

a phase amplitude characteristic adding filter 15 
for adding a short-term phase amplitude character- 
istic to said excitation signal. 

2. A speech coding apparatus according to claim 1, 
wherein said excitation signal generating means 20 
includes: 

an adaptive codebook for outputting an adap- 
tive vector; 

a random codebook for outputting a random 
vector; and 25 

an optimum code searching means for 
searching an optimum excitation; and 

uses a pulse train as said random vector. 

3. A speech coding apparatus according to claim 2. 30 
wherein a pulse interval of said pulse train is 
obtained from an adaptive code. 

4. A speech decoding apparatus comprising : 

linear prediction parameter decoding means; 35 

excitation signal generating means; 

a synthesis filter for synthesizing an output 
signal of said linear prediction parameter decoding 
means and an excitation signal output from said 
excitation signal generating means; *o 

phase amplitude characteristic decoding 
means for decoding a coded short-term phase 
amplitude characteristic; and 

a phase amplitude characteristic adding filter 
for adding the decoded phase amplitude character- 45 
istic to said excitation signal. 

5. A speech decoding apparatus according to claim 4, 
wherein said excitation signal generating means 
includes: so 

an adaptive codebook for outputting an adap- 
tive vector; 

a random codebook fa outputting a random 
vector; and 

excitation gain decoding means; 55 
and uses a pulse train as said random vector. 



6. A speech decoding apparatus according to claim 5, 
wherein a pulse interval of said pulse train is 
obtained from an adaptive code. 

7. A speech coding and decoding method comprising 
a coding process and a decoding process: 

said coding process including the steps of: 

coding a linear prediction parameter by linear 
prediction analysis of an input speech signal; 

quantizing and coding a phase amplitude 
characteristic which is obtained by analyzing a linear 
prediction residual signal of an input speech signal; 

selecting an excitation signal for generating 
optimum synthesized speech from an excitation 
codebook; 

adding a short-term phase amplitude charac- 
teristic to said excitation signal; and 

coding and transmitting said excitation sig- 
nal; and said decoding process including the steps 
of: 

generating an excitation signal and a 
decoded linear prediction parameter signal on the 
basis of a received signal; 

decoding the coded phase amplitude charac- 
teristic; 

adding the decoded phase amplitude charac- 
teristic to said excitation signal; and 

synthesizing said excitation signal and said 
decoded linear prediction parameter signal by a syn- 
thesis filter so as to generate an output speech sig- 
nal. 

8. A phase amplitude characteristic extracting appara- 
tus for extracting a short-term phase amplitude char- 
acteristic of a signal, comprising: 

a phase amplitude characteristic codebook 
which stores a plurality of short-term phase ampli- 
tude characteristics of a signal; 

a phase amplitude characteristic removing fil- 
ter for removing a phase amplitude characteristic; 

residual signal generating means for gener- 
ating a residual signal by removing a phase ampli- 
tude characteristic stored in said phase amplitude 
characteristic codebook from an input signal by said 
phase amplitude characteristic removing filter; 

pulse approximated signal generating means 
for generating a pulse signal representation signal 
by reducing said residual signal to a small number 
of pulses; 

trial signal generating means for generating 
a trial signal by adding each phase amplitude char- 
acteristic removed by said phase amplitude charac- 
teristic removing filter to said pulse signal 
representation signal; and 

selecting and outputting means for selecting 
a phase amplitude characteristic which minimizes a 
distortion between said trial signal and said input 
signal, from said phase amplitude characteristic 
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(57) A speech coding and decoding apparatus for 
improving the quality of synthesized speech. A coding 
portion (1) includes a filter (30) for adding a short-term 
phase amplitude characteristic to an excitation signal, 
and a coding circuit (29) for quantizing and coding a 
phase amplitude characteristic. A decoding^portion (2) 
includes a decoding circuit (31) for decoding the coded 



1 : COOING PORTION 



5: INPUT SPEECH 3 



1<L r 



EM 



m 



mm 

wm_ M 



28 
29 

1 



OBI ~ 

' OOHMMMTB 



11 



B T 
30 



mm 



mm 



KMNftAIETBI 

com mm 

.JsnmcssL 



phase amplitude characteristic, and a filter (32) for add- 
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